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ced-9 genomic 930608 Sequence 



10 20 30 40 50 60 70 80 90 100 

1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 



Clal Xhol 

BspDI Spel Ncol PaeR7I Earl 

T ▼ ▼ ▼ ▼ 

ATCGATAGTC GTCACCAAAT GffiTTITCCG ATTTCTCACT AGTCCATGGC TCACAATTTA CAAAATCTCG AGAAAAGAAA GGATGCAAGG AGTATGAAGA 100 



Sspl Dral BstBI 
▼ ▼ ▼ 

GGTTCCGAAT CTAAATATTT TAATTTAAAA AAATCAATTT 0GMTTGAM TTCAACTCCT ACTCGTTITG AAAATGCCAA TCCTTTAAGT AAACTTCTGG 200 



BstBI 
▼ 

ATQXTCATT TCTTCCAGAA ATTCQTCAA AGTAGTGGIT TIGTACTGAT TTCCTCCGCA AAGAATAGGA AOTTCGAAT CTCCTGGAGC GAAAOGGGAT 300 



Sspl 
▼ 

TTTSATAACA AAAMCTATC CAGACAAACC ATAGGACTTT TTCAAATATT CCTTATTTGG CIGTCCATTT GGAAGCACCC AATCTTTAAC GCTGTCCAGC 400 



Ncol 

▼ 

CAGAAGTGCT CCACTCGCCA AGGATAAAAG GCTCATTTTT GAAGCCGAAT TTTACTAAAA TCTCTAGCCA TGGAGTCGAT GGATCAGAAA TTCGAGGAAT 500 



TTTAGATTTC ATCTTGAAAT TTGCAATGGA AAAAATAATT ATTCAAAGAA AATCACAGAA AATGCAACAA AAAAAACAAA AAAAGAACAA AAAACAAGTC 600 



Smal Earl 
Xmal Esp3I 



GAAAAGTGCG CCCGGGTCGT TTGCTGACGC ATCTCITCAA ACGAGACGCG CIGOGGCGC ACTTCTCGTG CCCTGTGCGT GCATTTCCGC AACAAAATTC 700 



Fig. 2A 
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ced-9 genomic 930608 Sequence 



10 20 30 40 50 60 70 80 90 100 

1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 



AACACTTGTT TTGAMCGCA CCGCCCTGTT TCTITrTTCA ATTTTGATM GAAAATCAGC ATTGTTTCAG GATGATTAAC ATTCCAACTG 0GATTCTGTG 800 

Pvul 

Narl Clal 
KasI BspDI 

COGOTgS GCCAGATCGT CGATTTCCCG CTCCnTGGA ACATOGATCG TCACCAAGGT GGGGAnTFT TGAATTTTTC CGTGAAAATT GTTGATTTTT 900 



Asel Sspl 

TGTGTACGCA TGAAGGAGAA ATGTATAACA GACACATTCT TITCAATTAA TTATTTATAA TATTCACAGT CCGAGGCAAA GACGCCAATC CAGAAGTTCG 1000 



BspMII 

EC047III Muni BspEI 

▼ ▼ ▼ 

GATGGGAATA (TTGTTGAAG OGCGCTCCA AGAATCGCCC MTCGCTCCA CATCTCACCG TCTACCAGCC ACAATTGACC TGGATGCTCT CCGGATTCCA 1100 



TAGAATCAGC GGTTGTGTAA 1GGCCGGAAC COTCTCGTC GGAGGAATCG GATTCGCAGT TTTGCCGTTC GATTTCACOG C1TITGTGGA TTTCATCOGT 1200 

Bbsl EcoRI 
▼ ▼ 

AGCTGGAACT TACCATGCGC GGTGACCGCT GTOTCAAGT ACATCATTGC TTTCCCCATC ATOTCCATA CTCTMCGG AATTCGCTTC TTAGGATTCG 1300 



Asel Dral Asel Bglll 

▼ ▼ ▼ ▼ 

ATTTGGCTAA GGGAGTCAAT AATGTTGGAC AGGTAGGAGT TGAAATTATT AATTTAATTG TTITAAAATA AAAATTAATT TTCAGATCTA CAAATCGGGA 1400 



Fig. 2B 
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ced-9 genomic 930608 Sequence 



10 20 30 40 50 60 70 80 90 100 
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EcoRV fibsl Avrll 

▼ ▼ ▼ 

TATCTCGTAT CTCGAOTTC GGCTATTCTT GCTCTCGCCA TTGTOTCAA CTCTTGCCAG AACAAGAGCA ACAAGACTGC CTAGGCACAG ATGCTCCGCC 1500 



TimTTTTC TTACTCCGCC CCAGCCCTCG ACAATTCTCG TCAATTTACT TITACCGTTG ATTTCITCGA TTCTCTCICT TTTCCGTAGA TTTACCTCTC 1600 



Earl Xbal 
▼ ▼ 

crcrrcGTrr TrmTcrcr gtctagaatg tatattatga ttatgaaaac gaataaaaat tttagatgac acgctgcacg gcggacaact cgctgacgaa 1700 



TCOGGCGTAT CGGCGACGAA CGATGGCGAC TGGCGAGATG AAGGAGTTTC TGGGGATAAA AGGCACAGAG CCCACOGATT TTGGAATCAA TAGTGATGCT 1800 



Muni Earl 
▼ ▼ 

CAGGACTTGC CATCACCGAG TAGGCAGGCT TCGACGCGAA GAATGTCCAT CGGAGAGTCA ATTGATGGAA AAATCAATGA TTGGGAAGAG CCAAGGCTTG 1900 



EcoRV Sail 
▼ ▼ 

ATATCGAGGG ATTTGTGGTA ATTmTAAT TITITITIGT AAATAAAATT TCCTGCTGCT TCCAGGTCGA CTATITCACG CACCGAATCC GGCAAAACGG 2000 

Fig. 2C 
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ced-9 genomic 930608 Sequence 



j 



10 20 30 40 50 60 70 80 90 100 

1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 



AATGGAATGG TTTGGAGCAC OGGGATTGCC GTGTGGAGTG CAACCGGAGC ACGAAATGAT GCGAGTTATG GGAACGATAT TCGAGAAGAA GCACGCGGAA 2100 
Bsal PvuII 

MTfflGAGA CCTTCTGTGA GfflGCTGCTC GCAGTGCCCA GAATCTCATT TTCACTGTAT CAGGA1GTGG TTCGGACGGT TGGAAATGCA CAGACAGATC 2200 

BstBI 
▼ 

AATGTCCAAT GTCITATGGA CGTITGGTAA GGGAGAAAAT ACTGAAAAAA AGTITGCAAA MTTCGAAAA TTCGCCAGAA AGGTGGCAGA AAAAACATTT 2300 



GCAAAAATTG TnGTTTTCC TTCAGGAAAT CAGCAAAACT TGGTCAAAAA TAGCCCAATT ATGTGTCTTT TTTGAAAG1T TTCCATTAAA AAACCACGAA 2400 



Sspl 

EcoRI BstBI Dral 
▼ ▼ ▼ ▼ 

TTTTGATCCC GGATTGTAAT TITnTTGTT GATAAATTAG CAGAAAACIT TACGAATTCG ATTAAAAACG TTATTTTCTA TTCGAATATT TTTAAAGCAT 2500 



Bglll Dral 

▼ ▼ 

ATITTCCTTG ATTTGTATTT GCGAAAAAGA TCTGCTGATT TATCAAAAAT CGGTnTTAA ATGTAAAATT TGTGGAAAAT ACATTAAAAT TCGATITTTG 2600 



BstBI BstBI 
▼ ▼ 

AACnTITTC TTCGAAAAAC AGGTfflTO GCIGATTTGC TGAACGAAAA ACCCCAAAAA TTCAATTITC GAACATTAAA AACCAGAAAA ATQ7]TITIT 2700 



Fig. 2D 
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Hindlll 

TAMCTTAAT TITCCGCCAG AAATGAACGA ATTAAATTGC MffiTCTM TTTTCAGATA GGTCTAATCT CGTTCGGOGG TITCGTAGCT GCAAAAATGA 2800 



PstI Earl BamHI 

TGGAATCCGT GGMCTGCAG GGACAAGTGC GAAACCTCTT CGTTTACACA TCGCFGTTCA T(mCO!G^ATCCGCAAC AACTGGAAGG AACACAATOG 2900 



Smal 
Xmal 
▼▼ 

GAGCTGGGTA AGGAGTATTT GCATAGACAT TAGAAGTCAA TATCCCCCTT TCCCTAGTAC CCTTGACTTC (XGGGGTGTT GGTAAGCCGA TAATTACAGG 3000 



PvuII , Bsml 

T ▼ 

GTTCGGTAGC CTCTTGGGGG GACAGCTGGA AACATATTCA AGTATATTAC TGTTTATGAT AATGTTATTG TTACGGGAAT ACAAAATTCG CAGAATGCTA 3100 



Dral Dral 
▼ ▼ 

TTTCACAACA TATTTGACGC GCAAAATATC CAGTAGAGAA MCTACAGTA ATTCITTAAA TTITTAAAAT TnTACAATT AAAGAAAATA ACCACTAATC 3200 



Asel Dral 

▼ T 

AAAAGAAATT MTTTCAAAA ATCGAGCCCG TAAATCGACT ACAGTAGGCA TTTAAAGAAT TACTGTAGTT TTCGCTAOGA GATATTTCCG CCTCAAATAT 3300 



Bsml 
▼ 

GTTGTGAAAT ACGCATTCAC GGATnTTGT GTTCCCCGGA ATATGCTCTA AAGCATTATT TGTGAAAATA AAAAATCAAG AAAAAAATTG CAGGACGACT 3400 



Fig. 2E 
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ced-9 genomic 930608 Sequence 



10 20 30 40 50 60 70 80 90 100 

1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 



BspHI 

TCATGACACT OGGAAAACAA ATGAAAGAGG ACTA0GAA0G AGCAGAAGCT GAAAAAGTGG GACGCOGGAA GCAGAACAGA 0GGTGGTCGA TGATTOC 3500 



PvuII Asel 
TGGAGTAACA OTGGAGCCA TTGGAATCGT TGGAGTCGTC GTGTGTGGGC GGATGATGTT CAGOTCAAG TAACGTATTC AATITGTGTA AATAATCAAT 3600 



TEATGTACAA CTCCTTACAT TTGAATCTCA TITITGCTCA CTGATTCTCT CATCCTTTGA ACTGGAAGAA GTGGGAAAGC TAGGCCACAA ATTACGGCTC 3700 

MscI 

TCTGTGTOGA TTTACGATTT TACTGCAATT TnTCCGATT GCCTITITIT TTGGCCAAAC CCTACITCCG CGTAATATCA ACnTTCCGT GTTCTGTACA 3800 



Earl 

TTT0GTCAAA AACCCTGAAA CCCTMCTTT TCTOGCCGTG GCCTAGCCTC CCGCTTCTCT TCfflCATITC CAAAGTACCC CTGTATCTCA ATAATTCATC 3900 

SplI 
BsiWI 

Earl MM 

▼ ▼ ▼ 

TTCACTTTAA CrGTCTCnT TOGTGTGGCC TdTCCAACT CCCCCCAAAT TCCTGTAOGC GTACGOGACT TTGTATITAT TTTTTTCAAA TTCTnTCrC 4000 



TCTACAACAA CAAAAAAAAC GOTOTTTA TTCAACCCIT TTTTCGGAAC GAAACTGCAA TTTTGATAAT AGGCGTGOGC AAGAGAATCC GGTrTTCATT 4100 



Fig. 2F 
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ced-9 genomic 930608 Sequence 



10 20 30 40 50 60 70 80 90 100 

1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 

Xhol 
PaeR7I 
Esp3I 
Earl 

▼ ▼ 

TTCGCCATCA CGTCATCCAA AAAAGTTTAG TAGGAAAATA TCATTTnTA ATATAATGAT TCATCTTTCT GGCCTCTICT GTCTCGAGAC GACGGTCAAT 4200 



BstBI 
▼ 

TCGATGGCCT TGAATmTC GAAAACAAAA ATGTTTTTGT TTAGTGTAAA CGATCCCCCC GCCTTATOGC TGTTTCACCA TCAGATAGGC TC0GCCATTT 4300 



ApaLI 

▼ 

GATTCCCTTG MTTITGTCG GTATATAAAA CAAAAAACGT TAGTGCACGA TTCAAAAAAC AACAATGCGT GCTTTACTAT TCACCTCTGT TGTTCTnTG 4400 



Earl EcoRI 
▼ ▼ 

GCITOGCIT TTGTTGAGGC AAAGAAGCAG ACTATCACTGt TCAAGGGTAC AACTATTTGT AATAAGAAGA GAATTCAGGH GRAGGTTACC TITGGGAGAA 4500 

Still 

▼ 

AGATACTCGT GAGTTCTCAG TCTTGTTTAG CTTGAAACGG CTTAAAAAGG ACIAAAAAGG CCTAAAAATT GAAGTTTTCC ACCTGTTTTC AAAAGAAAGC 4600 



CGAATTGCAC AGOTTACAC GAGATTTCK; AATMTTTGT ATTTGAAATT TTCATATTCA TCCCCAAACG TKTITACAC GAAATTTTGC GATTTTTGAG 4700 

Bsal Oral 
▼ ▼ 

CTTAAAATAC GATACCTGGT CTCGACACGA AACATTITTG TTAAATTCAA AAAGATGTGC GCdTTAAAG AGTGCTGTAG TTTGAAACTT 



Fig. 2G 
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ced-9 genomic 930608 Sequence 

10 20 30 40 50 60 70 8~0 90 loT 

1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 



Clal 

BspDI Nrul 

▼ ▼ 

GfflCinTCA TCGOTTTTC GTAGCGTTIT TTIATAAGAA AAATGTATIT ATTTATICAA MMTTMTT 1TACCGAATC GOGAAAAACA AAATGMGAA 4900 

Sad 
MscI 
BslI 
Narl 

Seal Bsal KasI Apal Xmal 



CACCGATAAA AATATCGCAG CAACAATAGT TTGAAA1TAC AGTACTdTT TAAGGNGNNC ACATTTCCTA TATTTCACAC AAACTFGTCG TGT0GWCN 5000 



Snal Hindlll 
r ▼ 

GGGTATCGTC ATTITGATGC AGAAATCAAG AAAATTGCAT ATATGTTCAA AAAACCACAA TTATGGCGAA TTTCAAGCTT GAAACGAAAA TTCAQGAAAT 5100 



BstBI Ndel 

▼ ▼ 

TCTAAAMTT AAAAAAAAAT CATTCGAAAT GTGAAATTTG ATATTCAACT TGAAGTCCAT ATGC3CAMTT TOGTCTATTC CGNNMTCGA NNATCTTGrr 5200 

MI 

CCACCTGGCC GCGAAAAGAG AAAGCA0GAN NACTGATCTC TGGCAATTTT TTCCTGTACC GTGTCAA1TA TTTGAAACTC TAATAAGCTG GTATinTCT 5300 



Sspl 
▼ 

GCTATTGACA ACTAACTGAA TCCATAMTT GCAATTATAA TATTGACTTr TGATGTGTGG OTAGAAAAA AAAAACCAAA AACCTCATCT AGCTTTAGGC 5400 



Avrll Sail 

▼ ▼ 

TGCCAATATA TTCCTAGGAC ATATAAAAAA CCCITAAAAT TCTCTGCAAC ACCTACAAGC TATCAAACGT ACTATTAGTA TTCAATTITC CAGTCGACCC 5500 



Fig. 2H 
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ced-9 genomic 930608 Sequence 



10 20 30 40 50 60 70 80 90 100 

1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 



BamH 

CGATGACMG CTCGCCTCM TGCAATCGAA CAAAGAAGGA GAGTTCTCAC TTACCGGATC CGACGACGAG ATQCCTCAA TCTCTCCATA CCTCATAATC 5600 



ACCCACAACT GCftAQGTGAA GAAGGCOGGA TGCAAGCGTG TTTCAGAGTA TTTGATTCCA AAGGAGAAGA TCGGTGGAAC CTATGATATG ACATACGTCA 5700 



CTCITGATAT TCTTTCOGCT AAAGACAAGG AGAAGTGCTA AGAAAATGTT TnTTTGnT GGTTTGCTTG TITGGAAGGG AAGGACTTTC TATCTCmT 5800 



AATICAACAA TAAACTATTG GAAAACOGTT GAAATTTTAA CCTTGAACTG TAAGAAAAGT TGCGTGATTA TGTTGACAAT TTTGCCAAGT ATATCITTGT 5900 



EcoRV Sspl Asel Bsml 

T ▼ ▼ ▼ 

GGATATCACA ATAAACGAAG TCAAAGCACG AAATATTACG GAAACACAAA ATTAATGAGA ATGCGCAACA TATTIGACCG CAAAATATCT CGTAGCGAAA 6000 



Eco47III Sad Sspl 

CTACAGTAAT TCTTCAAAAG ACTACTGTAG QGCTGTGTCG ATTTACGAGC TffiATnTTG AAATGAATCA GACTAGAAGA AAAGGAGGAA AATATTGAAC 6100 



Muni Bbsl 
ATCMTTGAA CATCMTTCA AAAAGTCGAA CCCITCACTA CAGTAGTCTT CTAAAGAATT ACTGTAGTTT TCGCTACGAG ATATTTTGNG NGTCAAATAT 6200 



Fig. 21 
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GTTGNGCMT ACGCATCCTC AGAA1TGTGT GITCTCGTM TGTOTGAAA ATTTTCCATT TCAACATCAA ATMGCAAAT CTAAAAATGT GGGTTCTGCA 6300 



PstI Dral 

GCGACCACTA TGACTGTGAT CGTGGCAAGA CCCACTCAGA AAACTA0GTG TTCCITTAAA CAMTACATT 1TTMGTATT GTAGGTATAA AAATTGTTGG 6400 



Nhel Sail Bbsl Hindlll 

CTAGCAGTCT AGGCTGCCTT TTTCAOTCGA CAAACTTCTA ATTTAAT0GG OGGCTCrTCA AAAAGTCGTT TCITTGAAM TATAMGCIT TATATATTTA 6500 



EcoRV Spel 

TATATTAAAA ATTTTGATTA CATGATATCA AAAGOGACTA GTTTGTATAA AAATTATCAA 6560 



Fig. 2J 
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1 


GCGCCCGCCC 


CTCCGCGCCG 


CCTGCCCGCC 


CGCCCGCCGC 


GCTCCCGCCC 


51 


GCCGCTCTCC 


GTGGCCCCGC 


CGCGCTGCCG 


CCGCCGCCGC 


TGCCAGCGAA 


101 


GGTGCCGGGG 


CTCCGGGCCC 


TCCCTGCCGG 

^ ^^^^^^ A w www 


CGGCCGTCAG 


CGCTCGGAGC 


151 


GAACTGCGCG 


ACGGGAGGTC 


CGGGAGGCGA 


CCGTAGTCGC 


GCCGCCGCGC 


201 


AGGACCAGGA 


GGAGGAGAAA 


GGGTGCGCAG 


CCCGGAGGCG 


GGGTGCGCCG 


251 


GTGGGGTGCA 


GCGGAAGAGG 


GGGTCCAGGG 


GGGAGAACTT 


CGTAGCAGTC 


301 


ATCCTTTTTA 


GGAAAAGAGG 


GAAAAAATAA 


AACCCTCCCC 


CACCACCTCC 


351 


TTCTCCCCAC 


CCCTCGCCGC 


ACCACACACA 


GCGCGGGCTT 


CTAGCGCTCG 


401 


GCACCGGCGG 


GCCAGGCGCG 


TCCTGCCTTC 


ATTTATCCAG 


CAGCTTTTCG 


451 


GAAAATGCAT 


TTGCTGTTCG 


GAGTTTAATC 


AGAAGACGAT 


TCCTGCCTCC 
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501 


GTCCCCGGCT 


CCTTCATCGT 


CCCATCTCCC 


CTGTCTCTCT 


CCTGGGGAGG 


551 


CGTGAAGCGG 


TCCCGTGGAT 


AGAGATTCAT 


GCCTGTGTCC 


GCGCGTGTGT 


601 


GCGCGCGTAT 


AAATTGCCGA 


GAAGGGGAAA 


ACATCACAGG 


ACTTCTGCGA 


651 


ATACCGGACT 


GAAAATTGTA 


ATTCATCTGC 


CGCCGCCGCT 


GCCAAAAAAA 


701 


AACTCGAGCT 


CTTGAGATCT 


CCGGTTGGGA 


TTCCTGCGGA 


TTGACATTTC 


751 


TGTGAAGCAG 


AAGTCTGGGA 


ATCGATCTGG 


AAATCCTCCT 


AATTTTTACT 


801 


CCCTCTCCCC 


CCGACTCCTG 


ATTCATTGGG 


AAGTTTCAAA 


TCAGCTATAA 


851 


CTGGAGAGTG 


CTGAAGATTG 


ATGGGATCGT 


TGCCTTATGC 


ATTTGTTTTG 


901 


GTTTTACAAA 


AAGGAAACTT 


GACAGAGGAT 


CATGCTGTAC 


TTAAAAAATA 


951 


CAAGTAAGTC 


TCGCACAGGA 


AATTGGTTTA 


ATGTAACTTT 


CAATGGAAAC 


1001 


CTTTGAGATT 


TTTTACTTAA 


AGTGCATTCG 


AGTAAATTTA 


ATTTCCAGGC 


1051 


AGCTTAATAC 


ATTGTTTTTA 


GCCGTGTTAC 


TTGTAGTGTG 


TATGCCCTGC 


1101 


TTTCACTCAG 


TGTGTACAGG 


GAAACGCACC 


TGATTTTTTA 


CTTATTAGTT 


1151 


TGTTTTTTCT 


TTAACCTTTC 


AGCATCACAG 


AGGAAGTAGA 


CTGATATTAA 


1201 


CAATACTTAC 


TAATAATAAC 


GTGCCTCATG 


AAATAAAGAT 


CCGAAAGGAA 


1251 


TTGGAATAAA 


AATTTCCTGC 


GTCTCATGCC 


AAGAGGGAAA 


CACCAGAATC 


1301 


AAGTGTTCCG 


CGTGATTGAA 


GACACCCCCT 


CGTCCAAGAA 


TGCAAAGCAC 


1351 


AT C C AAT AAA 


ATAGCTGGAT 


TATAACTCCT 


CTTCTTTCTC 


TGGGGGCCGT 


1401 


GGGGTGGGAG 


CTGGGGCGAG 


AGGTGCCGTT 


GGCCCCCGTT 


GCTTTTCCTC 


1451 


TGGGAAGGAT 


GGCGCACGCT 


GGGAGAACGG 


GGTACGACAA 


CCGGGAGATA 


1501 


GTGATGAAGT 


ACATCCATTA 


TAAGCTGTCG 


CAGAGGGGCT 


ACGAGTGGGA 


1551 


TGCGGGAGAT 


GTGGGCGCCG 


CGCCCCCGGG 


GGCCGCCCCC 


GCACCGGGCA 


loUl 


TCTTCTCCTC 


CCAGCCCGGG 


CACACGCCCC 






1651 


CCGGTCGCCA 


GGACCTCGCC 


GCTGCAGACC 


CCGGCTGCCC 


CCGGCGCCGC 


1701 


CGCGGGGCCT 


GCGCTCAGCC 


CGGTGCCACC 


TGTGGTCCAC 


CTGGCCCTCC 


1751 


GCCAAGCCGG 


CGACGACTTC 


TCCCGCCGCT 


ACCGCGGCGA 


CTTCGCCGAG 


1801 


ATGTCCAGCC 


AGCTGCACCT 


GACGCCCTTC 


ACCGCGCGGG 


GACGCTTTGC 


1851 


CACGGTGGTG 


GAGGAGCTCT 


TCAGGGACGG 


GGTGAACTGG 


GGGAGGATTG 


1901 


TGGCCTTCTT 


TGAGTTCGGT 


GGGGTCATGT 


GTGTGGAGAG 


CGTCAACCGG 



Fig. 7B 
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1951 


GAGATGT CGC 

XJf*VJf& A VJ A V^ \J V^ 


CCCTGGTGGA 

X»* X* X* A W W A W Xrf *» 


CAACATCGCC 


CTGTGGATGA 


CTGAGTACCT 


2001 


GAACCGGCAC 

W*U* X^ X^ XJ VJ V^** V.* 


CTGCACACCT 

x*» a xj x^* »x^*» X^ A 


GG AT C CAGGA 


TAACGGAGGC 


TGGGATGCCT 


2051 


TTGTGGAACT 


GTACGGCCCC 

XJ A V^ Xrf x^ x^ x^ x^ x^ 


AGCATGCGGC 


CTCTGTTTGA 


TTTCTCCTGG 


2101 


CTGTCTCTGA 

X* A W A V* A X^ A VJ**A 


AGACTCTGCT 

flW*AV * X^ A XJ X^ 1 A 


CAGTTTGGCC 


CTGGTGGGAG 


CTTGCATCAC 


2151 


CCTGGGTGCC 

X-* W* A VJWW A W v v 


TATCTGAGCC 


ACAAGTGAAG 


TCAACATGCC 


TGCCCCAAAC 


2201 


AAAT AT G C AA 


AAGGTTCACT 

tftU^XJXJ A A V-**** X»* A 


AAAGCAGTAG 


AAATAATATG 


CATTGTCAGT 


2251 


G ATGT AC C AT 


GAAACAAAGC 


TGCAGGCTGT 


TTAAGAAAAA 


ATAACACACA 


2301 


T AT AAAC AT C 


ACACACACAG 


ACAGACACA'C 


ACACACACAA 


CAATTAACAG 


2351 


TCTTCAGGCA 


AAAC GT C GAA 


TCAGCTATTT 

A vnWv A** A A A 


ACTGCCAAAG 


GGAAAT AT C A 


2401 


rnirirn ^ irirTirprprprp 
A X J. XI X X X X X X 


ACATTATTAA 

nvn X X X X fUl 


GAAAAAAGAT 


TTATTTATTT 

A A •** AAA m3m* AAA 


AAGACAGTCC 


2451 


PATPAAAAPT 


PPGTPTTTGG 

V-* V^ \J ^ ^ J. ^ VJ VJ 


AAATCCGACC 

fuin x \— » vj** v-* v^ 


ACTAATTGCC 

f& X#» A fAf* A A VJ X^ V-* 


AAACACCGCT 

MM A Vb*a * V^ X^ W X^ A 


2501 


TCGTGTGGCT 

X WW X VJ A NJ W X 


CCACCTGGAT 

x»» Vf* X^ x^ a xj vj** A 


GTTCTGTGCC 

VJ A A X^ A W A W X^ X^ 1 


TGTAAACATA 


GATTCGCTTT 


2551 


CCATGTTGTT 

X^ X^** A XJ A A XJ a A 


GGCCGGATCA 


CCATCTGAAG 

x*> x^** * x*> A wmv 


AGCAGACGGA 


TGGAAAAAGG 


2 601 


ACCTGATCAT 

X^ X^ A w«* A X^f* A 


TGGGGAAGCT 


GGCTTTCTGG 

XJVJX^ AAA X* A W W 


CTGCTGGAGG 

X* A W X^ A W W** VJ XJ 


CTGGGGAGAA 


2 651 


GGTGTTCATT 

vj vj ^ vj j. jl J- J- 


CACTTGCATT 

x^*&x^ A A xjx^*» A A 


TCTTTGCCCT 

A X* AAA W v v X^ A 


GGGGGCGTGA 


TATTAACAGA 


2701 


GGGAGGGTTC 


CCGTGGGGGG 

w v W A XJXJVJ XJ XJ VJ 


AAGTCCATGC 


CTCCCTGGCC 


TGAAGAAGAG 


2751 


ACTCTTTGCA 

f*X^ A X*' AAA wvn 


TATGACTCAC 


ATGATGCATA 


CCTGGTGGGA 


GGAAAAGAGT 


2801 


TGGGAACTTC 

A Wv v*V* v A A X^ 


AGATGGACCT 


AGTACCCACT 


GAGATTTCCA 


CGCCGAAGGA 


2 851 


CAGCGATGGG 


AAAAATGCCC 


TT AAAT CAT A 

A A fU*f* A X^*£* A A A 


GGAAAGTATT 

W^ Wn*U* A * a A A 


TTTTTAAGCT 


2 901 


ACCAATTGTG 

nwwnni a vj ^ vj 


CC GAGAAAAG 


CATTTTAGCA 

X^A* A A A A f*W X^** 


ATTTATACAA 


TAT CAT C C AG 


2 951 


TACCTTAAAC 

x riw v^ x x nnn\« 


CCTGATTGTG 


TAT AT T CAT A 

A n A A A X^f* A 


T ATTT TGGAT 

A A A A A WW** A 


ACGCACCCCC 


3 001 
•j \j \j j- 


CAACTCCCAA 


TACTGGCTCT 

lAv X VJVJw X W» X 


GTCTGAGTAA 

vj x x sjxiNJ x 


GAAACAGAAT 


CCTCTGGAAC 

X^ X^ A X^ A W W*M* X^ 


3051 


TTGAGGAAGT 


GAACATTTCG 


GTGACTTCCG 


ATCAGGAAGG 


CTAGAGTTAC 


3101 


CCAGAGCATC 


AGGCCGCCAC 


AAGTGCCTGC 


TTTTAGGAGA 


CCGAAGTCCG 


3151 


CAGAACCTAC 


CTGTGTCCCA 


GCTTGGAGGC 


CTGGTCCTGG 


AACTGAGCCG 


3201 


GGCCCTCACT 


GGCCTCCTCC 


AGGGATGATC 


AACAGGGTAG 


TGTGGTCTCC 


3251 


GAATGTCTGG 


AAGCTGATGG 


ATGGAGCTCA 


GAATTCCACT 


GTCAAGAAAG 


3301 


AGCAGTAGAG 


GGGTGTGGCT 


GGGCCTGTCA 


CCCTGGGGCC 


CTCCAGGTAG 


3351 


GCCCGTTTTC 


ACGTGGAGCA 


TAGGAGCCAC 


GACCCTTCTT 


AAGACATGTA 
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3401 


TCACTGTAGA 


GGGAAGGAAC 


AGAGGCCCTG 


GGCCTTCCTA 


TCAGAAGGAC 


3451 


ATGGTGAAGG 


CTGGGAACGT 


GAGGAGAGGC 


AATGGCCACG 


GCCCATTTTG 


3501 


GCTGTAGCAC 


ATGGCACGTT 


GGCTGTGTGG 


CCTTGGCCAC 


CTGTGAGTTT 


3551 


AAAGCAAGGC 


TTTAAATGAC 


TTTGGAGAGG 


GTCACAAATC 


C T AAAAGAAG 


3601 


CATTGAAGTG 


AGGTGTCATG 


GATTAATTGA 


CCCCTGTCTA 


TGGAATTACA 


3651 


TGTAAAACAT 


TATCTTGTCA 


CTGTAGTTTG 


GTTTTATTTG 


AAAACCTGAC 


3701 


AAAAAAAAAG 


TTCCAGGTGT 


GGAATATGGG 


GGTTATCTGT 


ACATCCTGGG 


3751 


GCATTAAAAA 


AAAAT CAATG 


GTGGGGAACT 


ATAAAGAAGT 


AACAAAAGAA 


3801 


GTGACATCTT 


CAGCAAATAA 


ACTAGGAAAT 


TTTTTTTTCT 


TCCAGTTTAG 


3851 


AATCAGCCTT 


GAAACATTGA 


TGGAATAACT 


CTGTGGCATT 


ATTGCATTAT 


3901 


ATACCATTTA 


TCTGTATTAA 


CTTTGGAATG 


TACTCTGTTC 


AATGTTTAAT 


3951 


GCTGTGGTTG 


ATATTTCGAA 


AGCTGCTTTA 


AAAAAATACA 


TGCATCTCAG 


4001 


CGTTTTTTTG 


TTTTTAATTG 


TATTTAGTTA 


TGGCCTATAC 


ACTATTTGTG 


4051 


AGCAAAGGTG 


ATCGTTTTCT 


GTTTGAGATT 


TTTATCTCTT 


GATTCTTCAA 


4101 


AAGCATTCTG 


AGAAGGTGAG 


ATAAGCCCTG 


AGTCTCAGCT 


ACCTAAGAAA 


4151 


AACCTGGATG 


TCACTGGCCA 


CTGAGGAGCT 


TTGTTTCAAC 


CAAGTCATGT 


4201 


GCATTTCCAC 


GTCAACAGAA 


TTGTTTATTG 


TGACAGTTAT 


ATCTGTTGTC 


4251 


CCTTTGACCT 


TGTTTCTTGA 


AGGTTTCCTC 


GTCCCTGGGC 


AATTCCGCAT 


4301 


TTAATTCATG 


GTATTCAGGA 


TTACATGCAT 


GTTTGGTTAA 


ACCCATGAGA 


4351 


TTCATTCAGT 


TAAAAATCCA 


GATGGCGAAT 


GACCAGCAGA 


TTCAAATCTA 


4401 


TGGTGGTTTG 


ACCTTTAGAG 


AGTTGCTTTA 


CGTGGCCTGT 


TTCAACACAG 


4451 


ACCCACCCAG 


AGCCCTCCTG 


CCCTCCTTCC 


GCGGGGGCTT 


TCTCATGGCT 


4501 


GTCCTTCAGG 


GTCTTCCTGA 


AATGCAGTGG 


TCGTTACGCT 


C C AC CAAGAA 


4551 


AGCAGGAAAC 


CTGTGGTATG 


AAGCCAGACC 


TCCCCGGCGG 


GCCTCAGGGA 


4601 


ACAGAATGAT 


CAGACCTTTG 


AATGATTCTA 


ATTTTTAAGC 


AAAATATTAT 


4651 


TTTATGAAAG 


GTTTACATTG 


TCAAAGTGAT 


GAATATGGAA 


TATCCAATCC 


4701 


TGTGCTGCTA 


TCCTGCCAAA 


ATCATTTTAA 


TGGAGTCAGT 


TTGCAGTATG 


4751 


CTCCACGTGG 


TAAGATCCTC 


CAAGCTGCTT 


TAGAAGTAAC 


AATGAAGAAC 


4801 


GTGGACGTTT 


TTAATATAAA 


GCCTGTTTTG 


TCTTTTGTTG 


TTGTTCAAAC 



Fig. 7D 
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j 



4851 GGGATTCACA GAGTATTTGA AAAATGTATA TATATTAAGA GGTCACGGGG 

4901 GCTAATTGCT AGCTGGCTGC CTTTTGCTGT GGGGTTTTGT TACCTGGTTT 

4951 TAATAACAGT AAATGTGCCC AGCCTCTTGG CCCCAGAACT GTACAGTATT 

50 01 GTGGCTGCAC TTGCTCTAAG AGTAGTTGAT GTTGCATTTT CCTTATTGTT 

50 51 AAAAACATGT TAGAAGCAAT GAATGTATAT AAAAGC 
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i n3400 

I 20 
ATG ACA CGC TGC ACG GCG GAC AAC TCG CTG ACG AAT CCG GCG TAT CGG CGA CGA ACG ATG 
MTRCTADNSLTNPAYRRRTM 



40 

GCG ACT GGC GAG ATG AAG GAG TTT CTG GGG ATA AAA GGC ACA GAG CCC ACC GAT TTT GGA 
ATGEMKEFLGIKGTEPTDFG 

T\n2812 Q46Amber 

J 60 

ATC AAT AGT GAT GCT CAG GAC TTG CCA TCA CCG AGT AGG CAG GCT TCG ACG CGA AGA ATG 
INSDAQDLPSPSRQASTRRM 



A n3377 E74K 

TCC ATC GGA GAG TCA ATT GAT GGA AAA ATC AAT GAT TGG GAA GAG CCA AGG CTT 
SIGESIDGKINDWEEPRL 



80 



GAT ATC 
D I 



100 



GAG GGA TTT GTG GTC GAC TAT TTC ACG CAC CGA ATC CGG CAA AAC GGA ATG GAA TGG TTT 
E G F V V DYFTHRIRQNGMEWF 



BH4 

GGA GCA CCG GGA TTG CCG TGT GGA GTG CAA CCG GAG CAC GAA ATG 
GAPGLPCGVQPEHEM 



120 



ATG CGA GTT ATG GGA 
M R V M G 



BH3 



ACG ATA TTC GAG 
T I F E 



140 

AAG AAG CAC GCG GAA AAT TTT GAG ACC TTC TGT GAG CAG CTG CTC GCA 
KKHAENFETF CEQLLA 



A nl653 Y149N 



n2077 Q160Amber T 



GTG CCC AGA ATC TCA TTT TCA CTG TAT CAG GAT GTG GTT CGG ACG GTT GGA AAT GCA CAG 



R 



F S L Y 
nl950 G169E A 



D V V R T V 
n3407 splice acceptor 



N 



ACA GAT CAA TGT CCA ATG 
T D Q C P M 



TCT TAT GGA CGT TTG ATA GGT CTA ATC TCG TTC GGC GGT 
SYGRLIGLISFGG 



180 
TTC 
F 

200 

GTA GCT GCA AAA ATG ATG GAA TCC GTG GAA CTG CAG GGA CAA GTG CGA AAC CTC TTC GTT 



BH1 



M M E 



E 



R N 



TAC ACA TCG CTG TTC ATC AAA ACG CGG ATC CGC AAC AAC 
YTSLFIKTRIRNN 



220 



TGG AAG GAA CAC AAT CGG AGC 
W K E H N R S 



BH2 



TGG GAC GAC TTC 
W D D F 



M 



M 



E 



240 

ATG ACA CTC GGA AAA CAA ATG AAA GAG GAC TAC GAA CGA GCA GAA GCT 

E R A E A 

260 

GGC GCT GGA GTA ACA 
G A G V T 



GAA AAA GTG GGA CGC CGG AAG CAG AAC AGA CGG TGG TCG ATG ATT 
EKVGRRKQNRRWSMI 



280 

GCT GGA GCC ATT GGA ATC GTT GGA GTC GTC GTG TGT GGG CGG ATG ATG TTC AGC TTG AAG 
AGAIGIVGVVVCGRMMFSLK 



Fig. 11 



